The Nijmegen Corpus of Casual Spanish

نویسندگان

  • Francisco Torreira
  • Mirjam Ernestus
چکیده

Spanish is one of the best documented languages in the world. However, no large corpus of casual Spanish suitable for detailed phonetic analysis is available to our knowledge. The goal of this article is to introduce the Nijmegen Corpus of Casual Spanish (NCCSp from now on), a new corpus designed to fill this gap. The corpus was designed taking the Nijmegen Corpus Casual French as a model [Torreira et al., in press], which was also collected in our lab. The uniqueness of the NCCSp can be characterized as follows: • It contains around 30 hours of casual conversations among groups of friends. This makes it possible to study a wide range of phenomena characteristic of casual speech. • It contains speech from 52 native Madrid Spanish speakers sharing a similar educational background. • It contains large amounts of data for every speaker (around 90 minutes of recorded speech for every group of three speakers). This allows researchers to study withinspeaker variability. • It is orthographically annotated. • It contains video as well as audio data, which can be used by researchers interested in the use of facial and body gestures during verbal communication. The following sections provide a detailed description of the creation and transcription the NCCSp.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Weakening of Intervocalic /s/ in the Nijmegen Corpus of Casual Spanish

This study describes the weakening of intervocalic /s/ in the Nijmegen Corpus of Casual Spanish and investigates the role of several potential conditioning factors, including morphological, lexical and probabilistic variables. Three acoustic parameters were examined: voicing, the difference in high-band (4-8 kHz) intensity between /s/ and the following vowel, and the duration of the dip in low-...

متن کامل

The Nijmegen Corpus of Casual French

This article describes the preparation, recording and orthographic transcription of a new speech corpus, the Nijmegen Corpus of Casual French (NCCFr). The corpus contains a total of over 36 hours of recordings of 46 French speakers engaged in conversations with friends. Casual speech was elicited during three different parts, which together provided around ninety minutes of speech from every pa...

متن کامل

The Nijmegen Corpus of Casual Czech

This article introduces a new speech corpus, the Nijmegen Corpus of Casual Czech (NCCCz), which contains more than 30 hours of high-quality recordings of casual conversations in Common Czech, among ten groups of three male and ten groups of three female friends. All speakers were native speakers of Czech, raised in Prague or in the region of Central Bohemia, and were between 19 and 26 years old...

متن کامل

Cultural Influence on the Expression of Cathartic Conceptualization in English and Spanish: A Corpus-Based Analysis

This paper investigates the conceptualization of emotional release from a cognitive linguistics perspective (Cognitive Metaphor Theory). The metaphor weeping is a means of liberating contained emotions is grounded in universal embodied cognition and is reflected in linguistic expressions in English and Spanish. Lexicalization patterns which encapsulate this conceptualization i...

متن کامل

Impact of Irregular Pronunciation on Phonetic Segmentation of Nijmegen Corpus of Casual Czech

This paper describes the pilot study of phonetic segmentation applied to Nijmegen Corpus of Casual Czech (NCCCz). This corpus contains informal speech of strong spontaneous nature which influences the character of produced speech at various levels. This work is the part of wider research related to the analysis of pronunciation reduction in such informal speech. We present the analysis of the a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010